A Comparison of Posterior Simulation and Inference by Combining Rules for Multiple Imputation
نویسنده
چکیده
Multiple imputation is a common approach for handling missing data. It is also used by government agencies to protect confidential information in public use data files. One reason for the popularity of multiple imputation approaches is ease of use: analysts make inferences by combining point and variance estimates with simple rules. These combining rules are based on method of moments approximations to full Bayesian inference. With modern computing, however, it is as easy to perform the full Bayesian inference as it is to combine point and variance estimates. This begs the question: is there any advantage of using full Bayesian inference over multiple imputation combining rules? We use simulation studies to investigate this question. We find that, in general, the full Bayesian inference is not preferable to using the combining rules in multiple imputation for missing data. The full Bayesian inference can have advantages over the combining rules when using multiple imputation to protect confidential information. AMS Subject Classification: 62D99 and 62F15 keywords: Bayesian; confidentiality; missing; synthetic
منابع مشابه
Analysis of Variance from Multiply Imputed Data Sets
The analysis of variance is a popular method used in many scientific applications. There are standard software for handling unbalanced data due to missing values in the outcome/dependent variable. The analysis becomes difficult when the missing values are in predictors. Multiple imputation is an increasingly popular method for handling such incomplete data. This approach involves replacing the ...
متن کاملAn Empirical Comparison of Performance of the Unified Approach to Linearization of Variance Estimation after Imputation with Some Other Methods
Imputation is one of the most common methods to reduce item non_response effects. Imputation results in a complete data set, and then it is possible to use naϊve estimators. After using most of common imputation methods, mean and total (imputation estimators) are still unbiased. However their variances (imputation variances) are underestimated by naϊve variance estimators. Sampling mechanism an...
متن کاملMultiple Imputation for Causal Inference
The potential outcome framework for causal inference is fundamentally a missing data problem with a special, the so-called file-matching, pattern of missing data. Given the large body of literature on various methods for handling missing data and associated software, it will be useful to use such methods to facilitate causal inference for routine applications. This article uses the sequential r...
متن کاملAccuracy evaluation of different statistical and geostatistical censored data imputation approaches (Case study: Sari Gunay gold deposit)
Most of the geochemical datasets include missing data with different portions and this may cause a significant problem in geostatistical modeling or multivariate analysis of the data. Therefore, it is common to impute the missing data in most of geochemical studies. In this study, three approaches called half detection (HD), multiple imputation (MI), and the cosimulation based on Markov model 2...
متن کاملارزیابی صحت پیشبینی ژنومی در معماریهای مختلف ژنومی صفات کمی و آستانهای با جانهی دادههای ژنومی شبیهسازیشده، توسط روش جنگل تصادفی
Genomic selection is a promising challenge for discovering genetic variants influencing quantitative and threshold traits for improving the genetic gain and accuracy of genomic prediction in animal breeding. Since a proportion of genotypes are generally uncalled, therefore, prediction of genomic accuracy requires imputation of missing genotypes. The objectives of this study were (1) to quantify...
متن کامل